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Abstract: Rateless codes have been shown to be able to provide 
greater flexibility and efficiency than fixed-rate codes for multicast 
applications. In the following, we optimize rateless codes for un¬ 
equal error protection (UEP) for multimedia multicasting to a set 
of heterogeneous users. The proposed designs have the objectives of 
providing either guaranteed or best-effort quality of service (QoS). 
A randomly interleaved rateless encoder is proposed whereby users 
only need to decode symbols up to their own QoS level. The pro¬ 
posed coder is optimized based on measured transmission proper¬ 
ties of standardized raptor codes over wireless channels. It is shown 
that a guaranteed QoS problem formulation can be transformed 
into a convex optimization problem, yielding a globally optimal so¬ 
lution. Numerical results demonstrate that the proposed optimized 
random interleaved UEP rateless coder’s performance compares 
favorably with that of other recently proposed UEP rateless codes. 


Index Terms: multimedia, error control coding, unequal error pro¬ 
tection, raptor codes, video transmission 

I. Introduction 

Multimedia transmission is a main driver for explosive data 
traffic growth in wired and wireless networks. While decades 
of research have been conducted in designing reliable multime¬ 
dia transmission over error-prone channels, multimedia multi¬ 
cast over lossy packet networks is still challenging due highly 
variable channel conditions among different users, QoS con¬ 
straints and multimedia devices, e.g., smart phones, tablets, lap¬ 
tops. Scalable video coding (SVC) 0 is useful as it layers the 
source to enable efficient progressive reconstruction at the re¬ 
ceiver. 

Protection against channel impairments can be achieved by 
using codes that provide forward error correction (FEC): Reed- 
Solomon (4), low density parity check (LDPC) (5), Turbo (6J 
and fountain 13 0, as well as joint source-and-channel cod¬ 
ing (JSCC) 0 0 0. Other approaches that exploit source 
scalability to provide UEP use hybrid automatic repeat request 
or cross-layer optimization @ HU in. The above approaches, 
however, were mainly envisioned for point-to-point links and do 
not consider heterogeneous users’ QoS. As a result, adaptation 
of ISCC to multimedia multicast is often inefficient in that they 
cater to the lowest QoS user. 


A critical aspect of robust multimedia multicast is channel¬ 
coding performance. Traditional fixed-rate FEC encounters 
the problem of channel heterogeneity as in the case of Reed- 
Solomon (RS) codes that are targeted for one specific loss rate 
Q2. Rateless fountain codes US are efficient and flexible for 
broadcasting or multicasting over erasure channels. The rate¬ 
less property enables (1) a transmitter to generate, as needed, 
an unlimited number of encoded symbols, and (2) a receiver 
to successfully recover any subset of the encoded symbols of 
size slightly greater than the number of information symbols. 
Raptor codes fl4j due to their high performance and low com¬ 
plexity are fountain codes that have been incorporated into 
the third generation partnership program (3GPP) Multimedia 
Broadcast/Multicast Services (MBMS) standard Ifl5ll . In Ifl6l , 
raptor codes have been extensively evaluated for MBMS down¬ 
load delivery. A more recent version appears in fT3 , and back¬ 
ground can be found in (181 . Rateless codes have been ap¬ 
plied to SVC-based multi-source streaming ED, adaptive uni¬ 
cast streaming 0, and SVC streaming from multiple servers 
l20l . A JSCC rateless coding framework for scalable video 
broadcast appears in ETTl . Applications to distributed video 
streaming for relay/cooperation based receiver-driven layered 
multicasting is found in (22], while |23| and (24l use fountain 
codes for distributed video caching via user cooperation. 

While the raptor code itself is not suited for progressive de¬ 
coding, multimedia has a hierarchical source symbol priority 
structure necessitating unequal error protection (UEP), some¬ 
times referred to as priority encoding transmission (PET) l25l . 
Numerous UEP approaches to multimedia transmission have 
been proposed 0 l26ll (27l l28l . In 0, Mohr proposes a PET- 
based packetization scheme for transmitting compressed images 
over noisy channels. In (26]. the Mohr scheme is optimized to 
minimize end-to-end distortion. Optimization of receiver-driven 
networks has also been investigated |29l . Rate-distortion-based 
optimization can be found in (30l . Rather than incorporate 
code performance into the optimization, these existing optimiza¬ 
tion approaches generally employ maximum distance separable 
(MDS) codes. In this paper, code performance is taken into ac¬ 
count in the UEP rateless code optimization. 

Not surprisingly, UEP rateless code design methods have re¬ 
cently appeared. In (311 . message symbols are encoded by non- 
uniform selection of source symbols and applied to MPEG-II 
video transmission in (32l . In (33], expanding window fountain 
(EWF) codes organize source symbols into a sequence of nested 
windows. In (34], EWF codes are applied to scalable video mul¬ 
ticasting. Windowing approaches for rateless codes that achieve 
equal error protection (EEP) have been proposed in (35l and 
(36l . The sliding-window (SW) rateless code design proposed 
in (36l is applied to wireless video broadcasting in (33 . Ah- 
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mad et. al f38l achieve UEP in video multicast using the Luby 
Transform (LT) ll39l via block duplication. In l40l , a UEP rate¬ 
less code based on hierarchical graph coding is proposed for 
media streaming. However, these previous UEP rateless code 
design approaches M ED may compromise performance as 
they alter the LT code ll39l degree distribution unless the degree 
distribution is jointly-optimized with UEP parameters. 

Finally, previous approaches to UEP optimization for mul¬ 
timedia have focused almost exclusively on providing best- 
effort QoS, i.e., maximization of an average fidelity measure of 
video/image quality of end users for a given transmission rate 
0 l26l l30l or with rateless codes 1211 l32l l34l l37l . As 
rateless codes have no pre-determined transmission rate, QoS 
may be achieved by transmitting enough coded symbols to meet 
users’ QoS demands. In contrast, our focus is on guaranteed 
QoS optimization, i.e., minimizing resource usage under the 
constraints of heterogeneous QoS guarantees. While fl] and 
Cl presented early versions of this approach, this paper pro¬ 
vides more complete background, technical detail, a method to 
simplify constraints, as well as an example video multicasting 
application. 

Our main contributions are summarized as follows: 

1. a UEP scheme is proposed that uses random interleaving 
of raptor coders that enables direct application of already op¬ 
timized standardized raptor codes used for 3GPP MBMS 8131 . 
When applied to multicasting to heterogenous users, low band¬ 
width clients need not receive encoded symbols targeted to high 
bandwidth clients, which can significantly reduce receiver com¬ 
plexity and time to decode. 

2. the proposed design, optimized for multimedia multicast to 
heterogeneous users, contains QoS guarantees and factors in 
rateless code performance. With standardized raptor codes, this 
guaranteed QoS optimization problem is shown to be convex 
with a simplified solution using Karush-Kuhn-Tucker (KKT) 
optimality conditions. 

3. through a combination of simulation and analysis, perfor¬ 
mance of the proposed random interleaved UEP rateless design 
is compared to other EEP and UEP rateless coders. 

The paper is organized as follows: Section HT1 describes the 
system setup and proposed UEP rateless code design; SectionHIIl 
presents the problem formulations for guaranteed and best-effort 
QoS; SectionllVlprovides the solution for guaranteed QoS. Sec¬ 
tion [TV] transforms the original problem formulation for guaran¬ 
teed QoS into a convex optimization problem where optimal se¬ 
lection probabilities for interleaving are obtained in closed form 
for certain cases or else numerically. Comparisons with recent 
UEP rateless coding schemes are provided in SectionlVI 

II. System setup and proposed design 
A. System setup 

A multimedia server that transmits multimedia content simul¬ 
taneously to multiple users is considered, which may include 
streaming with strict delay requirements. Multimedia content is 
divided into multiple coded blocks. The server first compresses 
each source block using a pre-defmed source coder and then 
adds error protection to the source information using a rateless, 
e.g., raptor or LT code. Encoded symbols are then multicast over 
a wireless lossy packet network. 


User subscribers are classified into J classes according to re¬ 
ception capability. For Class j users, reception capability Sj is 
defined as the proportion of symbols that the receiver can suc¬ 
cessfully receive compared to the number of transmitted sym¬ 
bols, 1 < j < J. Therefore, in each transmission session, the 
number of successfully received encoded symbols for each user 
in Class j is SjM, where M is the number of symbols transmit- 
tecQ. Without loss of generality, we order the classes according 
to reception quality, i.e., 0 < <5i < A 2 - - - < Sj < 1. For example, 
a Class 1 user may represent a mobile cell phone with limited re¬ 
ception quality due to size and power restrictions, while a Class 
2 user may represent an automobile equipped with larger an¬ 
tenna and higher bandwidth service. Reception capabilities are 
determined by channel quality and bandwidth between server 
and receiver, and no distinction is made between overhead in¬ 
curred by symbol erasures and lost symbol transmission oppor¬ 
tunities due to client bandwidth restrictions. Users in different 
classes may also have different QoS requirements, and outage 
QoS guarantees are used to enable users to recover a given por¬ 
tion of source data with an achieved target probability. Without 
loss of generality, the term peak signal-to-noise ratio (PSNR), 
a common measure used for visual media quality, is used to de¬ 
note QoS. 

Let K represent the number of information symbols in a 
raptor-coded source block. Assume the server transmits M = 
(1 + e)K encoded symbols in order to meet all users’ QoS de¬ 
mands, where e is the total transmission overhead for all layers 
needed to combat losses of the heterogenous users in the mul¬ 
ticast system. For scalability, the coded source block is parti¬ 
tioned into L layers in decreasing order of importance: Layer 1 
contains the most important symbols while Layer L contains the 
least important symbols. For example, in video or image com¬ 
pression terminology. Layer 1 might represent the base layer 
(BL), and Layer 2 the first enhancement layer (EL). The num¬ 
ber of source symbols in Layer l is denoted by Si , 1 < l < L, 
and K = ]Uf =1 Si. 

Successful decoding of layer l requires layers 1, 2,..., L — 1 
to be decodable. Rather than jointly optimizing the source and 
channel coders, we focus on optimizing channel coding param¬ 
eters for a given source coder. Therefore, we assume that the 
values of S';, 1 < l < L are provided by a pre-determined scal¬ 
able source coder. 

B. Proposed UEP rateless code 

We propose a randomly interleaved UEP rateless encoder 
structure to provide FEC for multimedia multicast as shown in 
Fig. Q] The encoder assumes that source symbols have been 
allocated to the L layers prior to encoding. Encoding is per¬ 
formed by randomly selecting layer l with probability pi for 
( = 1,2,..., L where 1 Pi = 1- Encoded output symbols are 
generated by the raptor encoder for Layer l with code dimen¬ 
sion Si, degree distribution Cli(x) and precode C'i . The overall 
encoded data stream consists of interleaved raptor-encoded sym¬ 
bols from the L encoders. From the above definitions, e can be 

1 Lor analytical simplicity, the number of received symbols for each user class 
is modeled as Sj multiplied by the total transmitted as in (SJ. 
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Fig. 1. The proposed random interleaved UEP raptor coding method. 


lower bounded by 

1 cL 

Gni„ = K HCSeVfc-h (1) 

i= 1 

where (Sel)^ £ {Si, S 2 ,••• ,Sl} denotes the code dimension 
of the selected layer for user Class i. 

The proposed rateless coded scheme uses the random inter¬ 
leaving to achieve UEP. While probabilistic encoding has been 
used in EWF rateless codes in lf34l . as well as in ED and |42l . 
an advantage of the proposed scheme in Fig. |T|is that the differ¬ 
ent layers can be encoded and decoded separately. In addition, 
in OTfl and (34), degree distributions and selection probabilities 
need to be optimized jointly, which is a complicated task. A 
practical advantage of Fig. |T|when applied to a multicast system 
for users with different bandwidth constraints, low bandwidth 
clients need not receive symbols generated from source layers 
targeting high bandwidth clients, which reduces the complexity 
and time-to-decode for low BW clients. 

It is worth noting that one may alter the ordering of the out¬ 
put symbols from the random interleaved UEP raptor coder us¬ 
ing scheduling algorithms while maintaining the priority of each 
layer. Investigations along these lines have been recently pro¬ 
posed in (43) and ll44l . Unlike (45) (46) . the proposed need not 
specify a packetization structure; the scheme may be applied to 
data packets rather than to symbols. 

III. Problem formulations with QoS constraints 
A. Guaranteed QoS formulation 

We consider users that require playback media at a quality 
no lower than their own QoS requirement. Since the transmit¬ 
ter has to provide guaranteed QoS for all user classes before 
the start of transmission of the next source block, system de¬ 
lay and throughput for each source block is determined by the 
maximum number of transmitted symbols required to satisfy the 
QoS of each individual user class. As delay is a critical issue in 
multimedia multicast, the objective is to provide different lev¬ 
els of QoS guarantees according to users’ requirements while 
minimizing total transmission overhead e: 

Problem 1.0 (Guaranteed QoS): 


s.t. Prob (PSNRj > 7i ) > P h j = 1,2,..., J, (3) 

where PSNRj represents the PSNR of the successfully recov¬ 
ered source data of the Class j user given M = (1 + e)K trans¬ 
mitted symbols, and jj and 1 — Pj denote target PSNR threshold 
and outage probability, respectively, for the Class j user. The 
aim is to allocate coding rates across layers through optimiza¬ 
tion of the probabilities pi, 1 < l < L. 

The source (e.g., video, image) coder is assumed to be pro¬ 
gressive, so that the reconstruction media quality is determined 
mainly by the symbol errors in the lowest layer encountered in 
the recovery process. Let qi,l = 1,2, ..., L, represent the PSNR 
achieved when Layers 1 to l are successfully recovered by rap¬ 
tor decoding, where q\ < q- 2 --- < (//,. For a given source coder, 
if the source PSNR is represented as non-decreasing function, 
/(•), of the total number of source symbols decoded by the re¬ 
ceiver, then qi = f(^f ll Si). For each class 1 < j < J, let 
gj £ {1,2,..., L} be the minimum index that satisfies q gj > 7 j. 
In order to satisfy PSNR gj > 7 j, users in Class j require the 
raptor decoder to successfully decode, at minimum. Layers 1 to 
g :i . For a given UEP raptor code design, let P e (l,j) represent the 
error probability that the Class j decoder fails to decode layer l 
given transmission overhead e and reception quality Sj. In the 
most stringent case when decoding errors across layers are in¬ 
dependent, QoS requirements of end users can be simplified to 

9j 

H(i-p e (i,j))>Pj j= i, 2 ,..., j. (4) 

i=i 

B. Best-effort QoS formulations 

While the above formulation focuses on minimizing trans¬ 
mission overhead subject to satisfying guaranteed user QoS, 
this subsection considers transmission overhead that is upper 
bounded due to delay constraints or cost. For this scenario, 
given a maximum transmission overhead, e max , the service 
provider attempts to provide users of different classes with the 
best possible QoS. The following best-effort QoS problem ex¬ 
tends that in (34l by 1) considering both constrained and un¬ 
constrained cases, 2) allowing for allocating different weight¬ 
ing factors to different user classes as well as 3) possessing the 
previously mentioned advantages of the proposed random inter¬ 
leaved UEP raptor codes. 

The expected PSNR of users in Class j, which serves 
as a measure of the best-effort QoS, can be evaluated as 
E(PSNRj) = 'fld—iPipli, where qi is the PSNR achieved 
when Layers 1 to l are successfully recovered, where pij rep¬ 
resents the probability that a Class j user successfully recovers 
Layers 1 to l but fails to recover Layer l + 1. The optimization 
balances users of different classes with different channel qual¬ 
ities by assigning weighting coefficient Wj for Class j where 
0 < Wj < 1, i w j = 1- The choice of Wj depends on both 
the user class importance as well as the number of users in that 
class. The weighted average PSNR over all user classes then 
becomes the objective function: 

Problem 2: (Best-effort QoS) 

J / L 

1 ■« 

i=l \l =l 


mm 

Pi v iPL 


£ 


(2) 


max 

Pl ,P2,-”)PL 


( 5 ) 
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Subject tO £ < £max (6) 

where 

Pe{l + 1,3) nU(l - Pe(i,j)) 1 = 1, 2 ,L - 1 
nf=l(l -Pe(i,j)) l=L. 

(7) 

In Problem 2, no guaranteed minimum QoS is provided. For 
a given maximum transmission overhead, the service provider 
may instead aim to provide best-effort QoS to multiple user 
classes, but under the additional constraint of a minimum QoS 
guarantee for each user class: 

Problem 3: (Best-effort QoS with constraints on individual 
classes) 


One way to improve code performance for layers with fewer 
symbols is to merge source layers with similar optimized selec¬ 
tion probabilities p 3 into larger layers. However, for video the 
condition k < 200 is unlikely to occur. 

We also remark that standardized raptor codes outperform 
the recently proposed SW-raptor codes (37). For example, ac¬ 
cording to Fig. 2(b) of 137) . the SW-raptor codes have a de¬ 
coding failure probability of almost 100% with code dimension 
K = 5000 and overhead e = 0.03 while standardized raptor 
codes have negligible decoding failure probability at the same 
code dimension and lower overhead s = 0.01 according to CD. 

When more general LT or raptor codes using iterative decod¬ 
ing are employed, the decoding failure probability P e (l,j) can 
be approximated by assuming that symbol errors in iterative de¬ 
coding are mutually independent 


j / l \ 

max ^Wj ( ^Pij -qi 

Pl,p2,---,PL Z ' \ Z ' 

j= i \i=i / 


subject to 


F. < 


( 8 ) 

(9) 


9j 

and ]J(1-P e (l,j))> Pj j = 1,2,..., J (10) 

l=i 

where pij is given by © and g 3 g{1,2,...,L} (j = 1,2, ...J) 
is the minimum layer index that satisfies q g . > 7 3 . Problem 2 is 
a special case of Problem 3 without user QoS constraints. 

In the next section, we show that Problem 1.0 can be trans¬ 
formed to an equivalent convex optimization problem when 
standardized raptor codes are employed. Unfortunately, while 
Problems 2 and 3 cannot be similarly transformed due to the 
form of the pij expressions, they can still be solved numeri¬ 
cally by searching the (L — l)-dimensional parameter space of 
{pi, P 2 , ..., pl-i}, checking the constraints (TTOb and the result¬ 
ing average PSNR ®. When L = 2, the numerical method is 
significantly simplified as only p\ £ [0,1] that gives the maxi¬ 
mum average PSNR needs to be determined. Numerical results 
and comparisons for Problems 2 and 3 are provided later. 

IV. Solving the guaranteed QoS problem 
A. Evaluation of decoding failure probability 


Pe(l,j) = 1 — (1 — ei,j) S S (12) 

where eij is the symbol error probability of a Class j user de¬ 
coding Layer l (also see (3) of (34)) which can be analytically 
determined by and-or tree analysis 07). Since each layer is en¬ 
coded by a separate rateless code, evaluating the symbol error 
probability of each layer can be consider as a special case of 
(6) and (7) in ED where uniform selection is used (Um = 1 in 
ED), and Eq. (ITIl i can be approximated using 


1 


e U = 


exp (—^Q'(l-e^)) 


s, 




n = 0 
n > 1 


(13) 


where fl(.) is the LT code degree distribution, O'(.x') denotes 
derivative with respect to x, n is the number of decoding itera¬ 
tions and ti is the total number of encoded symbols transmitted 
for Layer l in each transmission block. The asymptotic symbol 
error probability e; j = lim^^oo e” • of iterative decoding can 
be estimated by choosing a large value n in Eq. ( 1 13 t i (see E3). 


B. Convexity analysis 

For a given transmission overhead e, ti = (1 + s)L p/,EI and 
satisfies Y^i =1 ti = ( 1 + e)K. When standardized raptor codes 
are used, substituting to = tiSj and k = Si into Eqs. (ITTI) and 
©. and taking the logarithm of the constraints described by ©. 
Problem 1.0 is transformed to: 


In the proposed design, existing high-performance standard¬ 
ized raptor codes can be directly applied, which enable low 
encoding/decoding complexity and overhead. Details about 
the pre-code, degree distribution and code construction can be 
found in lfl5) . (Annex B). When standardized raptor codes are 
employed with maximum likelihood (ML) decoding for code di¬ 
mension greater than 200, the decoding failure probability, i.e., 
failure to decode k source symbols after to symbols are success¬ 
fully received, have been shown, through extensive experimen¬ 
tation, to be accurately modeled by fl6l . 


PI (to, k ) 


1 if to < k 

ab m - k if to > k 


( 11 ) 


where constants a = 0.85, b = 0.567. For k < 200, Eq. (ITO 
underestimates the error probability due to short block length. 


Problem 1.1: 


min Sf =1 fi (14) 

s -t- -^fiilogl 1- ] + log Pj < 0 , j = 1, 2 ,..., J, (15) 

where c; = ab~ Sl , ctj = b Sj and g 3 £ {1, 2,..., L}. The con- 
straint that ti is non-negative is implicitly guaranteed by the 
log(.) function. To ensure an integer solution, we compute 
ti, £ 2 ? •••? as if real-valued, then round to the nearest larger 

2 Strictly speaking, ti is a Binomial-distributed random variable with mean 
(1 + e)Kpi. However, the randomization of ti has little effect on the problem 
of interest when averaged over a large number of realizations. In addition, one 
can always schedule the selection of layers to make sure that ti is proportional 
to p x . 
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integer. Although the above transformation uses the decoding 
failure probability evaluation of standardized raptor codes given 
by Eq. (mi . a similar method can be applied to other decod¬ 
ing failure probability models that can be approximated by an 
exponential function. 

To solve Problem 1.1, we first prove convexity. As the ob¬ 
jective function is linear, we only need to prove that the con¬ 
straint functions are convex with respect to ti, l = 1, 2,..., L. It 
can be shown that for l = 1,2,..., L, the second derivatives of 
— log(l — cia * 1 ) with respect to ti satisfy 


According to the second order condition of convex functions 
l48l . — log[l — qq;* 1 ] is a convex function of ti. Since 
nonnegative weighted sums preserve convexity (48) , the con¬ 
straint functions 03 are convex functions of the vector t = 
[t i, 1 2 , ...,tL] T . Problem 1.1 can therefore be solved numeri¬ 
cally by available convex optimization algorithms (48). We re¬ 
mark that the above convexity holds not just for values of a and 
b in the exponential model of Eq. CD from ffiD but also more 
generally over the range 0 < a < 1 and 0 < b < 1 which rep¬ 
resent a wide family of exponential fountain code failure proba¬ 
bility models. 

Let t = [ti,t 2 , ...,t L ] T and A = [Ai, A 2 ,Aj] T be 
the variable vectors of the primal and dual problems of Prob¬ 
lem 1.1, respectively. If t* = [t^tj, ...,tL] T and A* = 
[Ai, A£, A}] T represent sets of primal and dual optimal 
points, they must satisfy the Karush-Kuhn-Tucker (KKT) opti¬ 
mality conditions for the objective function /o(.) and constraint 
functions fj (.): 


/,(t*)<0, j = 1, 2,..., J 

(17) 

A* > 0, j = 1,2,..., J 

(18) 

A*/,(t*) = 0, j = 1,2,..., J 

(19) 

J 


V/o(t*) + 5>j*Vfj(t*)=0 

(20) 


j=i 


where here /o(t) = SjCjtq and fj(t) = —Sfj =1 log[l — 
Cia j l ] +l°gPj' j = 1; 2,..., J. Since the original Problem 1.1 is 
convex and satisfies Slater’s condition, the above KKT optimal¬ 
ity conditions provide the necessary and sufficient conditions 
for optimality (48) . In general, solving the KKT condition is not 
straightforward. However, if we can identify a set of inequality 
constraints that are most likely to be active, i.e., achieve equality 
at the optimal solution, then we can obtain a corresponding set 
of primal and dual solution points and verify the optimality with 
KKT condition. 

A simplification to Problem 1.1 arises if we have a one-to-one 
mapping between user classes and channel coding layers, i.e., 
9j = j f° r j = 1,2,.., J and L = J, which is the assumption 


used in the formulation of (34), and if all the inequality con¬ 
straints are active. Using the above assumption, the solution to 
Problem 1.1 can be obtained by finding t\ using the constraint 
for Class 1 in Eq. (fl5T> and substituting the solution of t \ into 
the next constraint, solving for £ 2 with the constraint for Class 2 
in Eq. (ful l etc. until all of the variables t \, i 2 , ...,£/. are deter¬ 
mined. However, since this simplification has not been proven 
to be equivalent to Problem 1.1 in general, the solution obtained 
in this manner has to be verified using the KKT optimality con¬ 
ditions. If all the inequality constraints are active, Eqs. ([m and 
m are automatically satisfied. Therefore, if we obtain a solu¬ 
tion t* of Problem 1.1 by solving fj( t*) = 0,j = 1, 2,..., J, 
we can substitute the value of t* into Eq. (l20t and obtain A*. If 
A* satisfies Eq. ( fl8l >. i.e.. A* > 0 ,j = 1,2,..., J, then we have 
proven that the value of t* we obtained is indeed an optimal so¬ 
lution of Problem 1.1. If the KKT optimality condition is not 
satisfied, then numerical methods can still be used to solve this 
convex optimization problem. 

C. Class-to-layer mapping algorithm 

In the following, we propose an algorithm to transform a gen¬ 
eral guaranteed QoS problem into a problem with one-to-one 
mapping between user classes and channel coding layers. The 
idea is to reduce the dimensionality of the problem by removing 
redundant user constraints and merging source-coding layers. 
The process is explained in the following algorithm: 

Algorithm 1: (Class-to-layer mapping algorithm) 

Step 1 (User class amalgamation): Repeat the following class 
amalgamation operation until gi < gi, for every i < k, where 
1 < i < ./, 1 < k < ./: for any pair of user class indices i and k 
where i < k (hence <5, < 6k), if Class i users have the same or 
higher target PSNR threshold than Class k users (i.e., 7, > 7*. 
or gi > gk), we absorb Class k into Class i. 

Step 2 (Source layer merging): Repeat until for every layer 
1 < l < L, there exists a class j, 1 <j< J such that gj = l: 
if there exists a source layer l where there is no corresponding 
user class (i.e., no j exists such that gj = l). Layers l and l + 1 
are merged to form a new source layer V with code dimension 

St' = Si + S l+ i. 

Step 1 finds a set of the most demanding user classes with 
respect to their channel conditions; Step 2 reduces the number 
of channel coding layers to the minimum without compromising 
the performance. After performing Algorithm 1, we can show 
the following fact: 

Lemma 1: After performing Algorithm 1, L = J and gj = j 
for j = 1,2,..., J. If for every Class k that has been absorbed 
into Class i in Step 1, P, > I\. is also satisfied, then the new op¬ 
timization problem after performing Algorithm 1 is equivalent 
to Problem 1.1. In addition, any further partitioning of layers 
cannot reduce the minimum transmission overhead required to 
achieve the QoS requirements. 

Proof: First we show that any QoS constraint dropped from 
Step 1 (user class amalgamation) is irrelevant. Suppose the QoS 
constraint of Class i users is satisfied, i.e., nfi^l 
Pi. Since i < k, we have Si < 6k- Hence, Class k users receive 
more coded symbols than Class i users. Therefore, the decoding 
failure probability P e (l, i) > P e (l, k) for all 1 < l < L. Then, 


<9 2 [-log(l-Cia‘‘)] _ Clot'- (log o.j'f' 
dtf (1 -c z a‘.‘) 2 


>0 j = (16) 
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because gi > gk, from the assumption of Lemma l. Pi > Pk, 
and 


9 k 

11(1 -P e (l,k)) > 
1=1 

> 


lb 1 -AM) 

1=1 

9i 

j[[(l - P e (M)) > A > A-(21) 


1=1 


Hence, the QoS constraint for Class k users is also satisfied. 

Next we show that after performing Algorithm 1, the number 
of source layers L and the number of user classes J are equal. 
The class amalgamation procedure ensures that the set g :l , j = 
1,2,..., J is monotonically increasing with j. This fact does 
not change after performing the source layer merging procedure. 
Since gj e {1,2,..., L}, we have L > J. On the other hand, 
source layer merging ensures that for any l = 1, 2, ...L, there 
exists an integer j £ {1, 2,..., J} such that g 7 = l. Therefore, 
we also have L < J. Thus, L = J. Together with the fact 
that gj is monotonically increasing with j, we can conclude that 
9 j =3 for .7 = 1,2,..., J. 

Finally, to complete the proof, in the appendix we show that 
any further partitioning of layers cannot reduce the required 
minimum transmission overhead. QED. 

Remark 1: The condition that for every Class k that has been 
absorbed into Class i, Pi > Pk, is a sufficient condition for 
Lemma 1 but not a necessary condition. Even if this condition 
is not satisfied, it is possible that the transformed problem due 
to Algorithm 1 results in the optimal solution. In addition, if 
this condition is violated, to ensure that the optimal solution of 
the transformed problem is the optimal solution of the original 
problem, we can always verify if the obtained solution satisfies 
all the constraints of the user classes that have been amalga¬ 
mated in Step 1. If not, the convex Problem 1.1 can be solved 
numerically. This is further illustrated in Section HV-DI 

Remark 2: For best-effort QoS Problem 3, the transformation 
given in Algorithm 1 may not apply, as an optimal solution also 
depends on the fidelity measure of the multimedia source. 

Remark 3: In the original general problem L and J are arbi¬ 
trary, which means it is possible that a user with worse channel 
quality may have a higher QoS requirement. Lemma 1 and the 
mapping algorithm transform the original problem to a progres¬ 
sive transmission problem where there is a one-to-one mapping 
between user classes and channel coding layers. 

Remark 4: In the case of L = J, the transmission overhead 
can be lower bounded by Eq. <|H) which is independent of code 
optimization. Minimizing e, as in Problems 1 and 1.1, maxi¬ 
mizes code performance. 


D. Video multicasting numerical example 

We now illustrate the mapping process and solution to the 
guaranteed QoS problem for multicasting a H.264 SVC 0 
video-coded stream which contains 15 layers: a base layer (BL) 
and 14 enhancement layers (ELs). Since our focus is on opti¬ 
mizing a channel coder for a given source coder, the number 
of information symbols and the corresponding PSNR values are 
taken from Table I of (34). As in f34l . each source symbol repre¬ 
sents 400 source bits. The UEP rateless encoders and decoders 


operate at the symbol level. We assume there are four classes of 
users with reception capabilities and QoS requirements shown 
in Table Q] 


Table 1. Example of user classes and their QoS requirements 


User class index (j) 

1 

2 

3 

4 

User reception capability Sj 

0.4 

0.5 

0.6 

1 

User QoS req. (PSNR thr. -y^ (dB)) 

25.79 

29 

27.25 

40.28 

# Decoded symbols to achieve QoS 

400 

1155 

700 

3800 

# Decoded source layers required (gj) 

1 

4 

2 

15 

Probability threshold Pi 

0.8 

0.9 

0.85 

0.95 


Using the previously described simplification strategy for 

class and layer mapping, we observe that t /2 > [h while 82 < S 3 , 

which means that Class 3 users have both better reception capa¬ 
bilities and lower PSNR requirements than Class 2 users. There¬ 
fore, the QoS constraint from Class 3 users can be dropped. 
Then, since the number of layers required by the three classes 
are 1, 4 and 15, after the layer-merging procedure of Algorithm 
1, we obtain a new set of channel layers with Layer 1 compris¬ 
ing the BL, Layer 2 consisting of the first 3 ELs, and Layer 3 
consisting of the fourth to fourteenth ELs. Since P 2 > P 3 , from 
Lemma 1, the new problem after mapping is equivalent to the 
original problem. The parameters of the transformed problem 
after the mapping are shown in Table [2] 


Table 2. User classes and QoS requirements after the mapping. 


Combined class-layer index (j or 1) 

i 

2 

3 

Reception capability Sj 

0.4 

0.5 

1 

PSNR threshold 7 ^(dB) 

25.79 

29 

40.28 

Number of decoded symbols to achieve QoS 

400 

1155 

3800 

Number of decoded layers required gj = j 

1 

2 

3 

Probability threshold Pj 

0.8 

0.9 

0.95 

Number of symbols Si in each layer 

400 

755 

2645 


To determine the interleaving probabilities for the standard¬ 
ized raptor codes for the three new layers, p\, p 2 and p :s need to 
be determined to minimize L + f -2 + £3 such that 

f (1 — a 6 ^ 1<5l_Sl ^) > Pi 
i (1 - ab^ 62 ~ s i))(l - ab^~ s ^) > P 2 

[(1 a 5( ti < 5 3-Si))(i _ ab (t 2<53-s , 2 ))( 1 _ ab (t 3 s 3 -s 3)) > p 3 

Assuming all the inequality constraints are active, we obtain 
a minimum overhead e m i n = 36.2%, which is achieved when 
pi = 0.1946, p 2 = 0.2933 and P 3 = 0.5121. The solution 
is then verified to be optimal using KKT conditions. In con¬ 
trast, equal error protection (EEP) allocation requires a mini¬ 
mum overhead of 152%, a factor of over four higher. 

With the optimal selection parameters, we find that Class 3 
users of the original problem (Table Q} can successfully decode 
the base layer and one enhancement layer with a probability 
higher than 99.9%. This means that even if the target probability 
threshold P 3 = 99% in Table [Q which violates the assumption 
of Lemma 1, the problem transformed by Algorithm 1 still has 
the same optimal solution as the original problem. As a further 
remark, let us suppose that the conditions of Lemma 1 were vi¬ 
olated, and we assume the extreme case of P 3 = 99% and vary 
the value of £3 within the range 0.5 = 82 < S 3 < 1 = 83 . In that 
case, only when 0.5 < 83 < 0.503, our obtained solution does 
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Fig. 2. UEP versus EEP for varying S 1 /S 2 , standardized raptor codes, 
K = 1155; <5= [0.5, 0.9]; P = [0.95,0.9]. 



Fig. 3. UEP versus EEP for varying 8 ^/ 82 , standardized raptor codes, 
S = [400, 755]; 82 = 0.9; P = [0.95,0.9]. 


not satisfy the QoS constraint of Class 3 users. In practice, how¬ 
ever, distinct classes would have a greater reception capability 
difference than x 100% = 0.6%. 

0.500 

V. Numerical and simulation results 

This section provides comparisons of the proposed random 
interleaved UEP rateless code design to EEP codes and to other 
recent UEP rateless codes. The parameters of the different sce¬ 
narios are described in the corresponding figure captions. Per¬ 
formance of LT codes are evaluated using and-or tree analysis 
while standardized raptor codes are evaluated using Eq. m. 
Simulations are also used to confirm the and-or tree analysis. 

Figs. U to [4] comp are the proposed UEP design to EEP design 
for the guaranteed QoS problem when standardized raptor 
codes are employed. The minimum transmission overhead is 
evaluated using the method described in Section lTV-DI For sim¬ 
plicity, only two layers are considered. The dimension of the 
standardized raptor code used in layer 2 is Si. The inefficiencies 
incurred by the standardized raptor codes are characterized by 
the decoding failure probability PJ (.) in Eq. (fTTb and are small 
as expected. The optimal selection probability p\ and minimum 
overhead for the UEP scheme are obtained by the simplified 
method described in Section HV1 for solving Problem 1.1, i.e., 



Fig. 4. The effect of layer allocation probability pi , standardized raptor 
codes, L = 2, K = 1155; S = [400,755]6 = [0.5,0.9]; P = [0.95,0.9]. 


by assuming that all inequality constraints are active. All results 
shown in Figs. [2]to[4]were verified to satisfy the KKT optimality 
conditions. To achieve EEP, the ratio P 1 /P 2 is fixed to .S'i / S 2 - 
Fig. [2] shows minimum transmission overhead, e, required for 
optimized UEP and EEP raptor codes as the ratio between the 
numbers of bits in the two layers is varied. Fig. [^compares UEP 
and EEP as a function of channel reception quality of the first 
user class, <5i. It can be seen that UEP has a significant advan¬ 
tage over EEP whenever the channel reception qualities of the 
two classes differ appreciably. Fig. [4]plots minimum transmis¬ 
sion overhead, e, as a function of selection probability p\ where 
it is observed that e is very sensitive to the choice of p\. In par¬ 
ticular, a non-optimized allocation scheme may be significantly 
outperformed by EEP. 

To enable comparison with the UEP raptor code from ED 
as well as with EWF codes from [l34l . rather than use raptor 
codes, we employ iteratively decoded LT codes that have degree 
distribution m 

Q r (x) = 0.007969a; + 0.493570a; 2 + 0.166622x 3 
+0.072646a; 4 + 0.082558a; 5 + 0.056058a; 8 + 0.037229a; 9 

+0.055590a; 19 + 0.025023a; 65 + 0.003 1 35a: 66 (23) 

for all layers as used for UEP codes in [31] and for the EWF 
code |]34l . That is, for analytical simplicity, no pre-code is used 
in any of the schemes. The decoding failure probability P e on 
the left side of the constraint functions in Eq. (01 is evaluated as 
follows: the symbol error probability ei of Layer l for the UEP 
rateless codes in 1311 , the EWF code, and the proposed random 
interleaved scheme are estimated by and-or tree analysis and 
obtained using Eqs. (6) and (7) in OTI . Eq. (7) in lf33l . and 
Eq. (IT3l > in this paper, respectively. The failure probability of 
decoding each layer is estimated as P e (l) = 1 — (1 — ei) Sl . 

Parameter optimization of the other schemes can be found 
in [5] and are not reproduced here. Fig. 5 in O provides the 
minimum transmission overhead required to satisfy all the user 
constraints of the proposed random interleaved scheme as well 
as that in ED using different values of kM, a parameter that 
governs the degree of non-uniformity of input symbol selection. 
Fig. 6 in (2) shows a similar comparison between the proposed 
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scheme and the EWF code. The size of the first window in the 
EWF code is fixed to the number of symbols in Layer 1 (Si). 
Parameter Ei is the probability of choosing the more impor¬ 
tant first layer during encoding (see l34l ). It can be observed 
that when all schemes are optimized, the proposed random in¬ 
terleaved rateless code performance matches that of 1311 as well 
as l33l . The existence of two local minima in Fig. 6 in 0 
is due to the symbol error rates of the more important bits not 
decreasing mono tonic ally as Ti increases (see Fig. 1 in j34j). 

An advantage of EWF codes f33l over those in f3T1l is flex¬ 
ibility in deploying different degree distributions for different 
windows. Fig. [5] plots transmission overhead as a function of 
numbers of symbols in the first layer or window for the three 
UEP schemes each using LT codes after optimization over their 
respective parameters where different degree distributions are 
applied to different EWF code windows as well as to different 
layers of the proposed UEP scheme. Degree distributions, cho- 



Fig. 5. Performance comparisons using LT codes with different degree 
distributions with L = 2; K = 9000; 5 = [0.4,0.8]; P = [0.95, 0.8]. Also 
shown are EWF from ED and non-uniform selection from ED. 

sen for the more important bits (MIB) and less important bits 
(LIB), are denoted as f2i(x) and fQfa:;), respectively. Degree 
distribution ft r (x) described by Eq. (1231) is used as well as a 
truncated robust soliton distribution (RSD) Cl rs (k rs , <5, c), where 
k rs is the maximum degree, is applied to the MIB for the EWF 
and proposed random interleaved schemes. The truncated RSD 
has better error performance compared to f l r ( x) at the cost of 
higher decoding complexity. It can be seen from Fig. [5]that the 
truncated RSD for the MIB provides a significant performance 
boost for both schemes. When the same degree distributions 
are used, the proposed random interleaved scheme matches the 
performance of existing schemes. 

We note that in Fig. [4] £ m ; n = 0.432, indicating that the code 
performance nearly achieves minimum overhead while the code 
performance shown in Fig. [3] does not come close to the min¬ 
imum overhead. This difference is mainly attributable to the 
use of standardized raptor codes, which includes a high perfor¬ 
mance pre-code as well as efficient maximum likelihood (ML) 
decoding in contrast to the iterative decoding used for Fig. 0 
It can be argued that the performance of existing UEP designs 
in OTl and l(33ll can similarly benefit from a precode and ML 
decoding. However, ML decoding complexity of the proposed 


random-interleaved design would likely be lower due to a lower 
dimension decoding matrix obtained from separate-layer decod¬ 
ing. In addition, the code structure and generating matrix of 
systematic standardized raptor code implementation has been 
highly optimized, including the decoding schedule in the code 
constraint processor |49l . To the authors’ best knowledge, such 
techniques have not been applied to EWF codes, which may also 
be complicated by their overlapping structure. 

Fig. [6]shows source reconstruction quality, in terms of PSNR, 
of the proposed random interleaved and EWF schemes for the 
best-effort QoS formulations of Problems 2 and 3. Transmis¬ 
sion of H.264 SVC coded CIF Stefan video sequence ll34ll is 
performed in two layers, with the first (base) layer containing 
Si = 400 symbols and all enhancement streams comprising the 
second layer with S 2 = 3400 symbols. Successfully decoding 



Fig. 6. Average PSNR performance of UEP schemes. LT codes with iterative 
decoding. L = 2; S = [400, 3400]; <5 = [0.55,1]; P = [0.95,0.8]; e max = 
1 ;w= [0.5, 0.5]. 


the first layer provides a PSNR of 25.79 dB while decoding both 
layers provides a PSNR of 40.28 dB. Performance is plotted as 
average PSNR versus selection probability p\ for the proposed 
random interleaved scheme and the first window selection prob¬ 
ability Ti of the EWF code. Given p\ or L|, average PSNR 
is obtained numerically by setting £ = £ max and substituting 
the corresponding decoding failure probabilities p e (i, j) into Q 
and ®. It should be noted that selection probabilities p\ and 
k 1 for the two different schemes are not directly comparable. 
For the cross-marked and star-marked curves, we have used the 
LT code with an iterative decoder and degree distribution tt r (x) 
applied to all windows and layers. For these parameters, both 
the proposed random interleaved and EWF schemes provide a 
maximum average PSNR of around 32.4 dB. For Problem 3, 
the feasible regions of selection probabilities p\ and I 1 are ob¬ 
tained by checking constraints (flOt . We note that for Problem 3, 
the maximum achievable average PSNRs remain the same since 
both optimal operating points of the proposed UEP scheme and 
the EWF code lie inside the feasible regions. The diamond- 
marked curve shows the results when standardized raptor codes 
are employed for the proposed random interleaved UEP scheme. 
A maximum average PSNR of 40.28 dB can be achieved for 
0.11 < pi < 0.18, which, as expected, is significantly higher 
than the other two LT coded curves. We can also observe from 
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Fig. [ 6 ] that different choices of p\ result in significant differ¬ 
ences in average PSNR, showing the need for optimization. Fi¬ 
nally, we observe that the steep performance curve of standard¬ 
ized raptor codes results in only two obtained PSNR values. 

The above LT coding / iterative decoding results are obtained 
using and-or tree analysis which assumes infinite block length. 
As a check, simulation of LT codes with degree distribution 
£l r (x) and iterative belief propagation (BP) decoding ll39l lfl4l 
are provided in Fig [7| Layer selection parameter p\ = 0.19 
obtained from Eqs. (Il2l) and ( 1 1 31 ) determine the constraints in 
Problem 1.0. The horizontal axis depicting transmission over¬ 
head includes the minimum overhead achieved by and-or tree 
analysis ( £ a nd-or = 1.475), as well as 5% (1.525) and 10% 
(1.575) greater than the minimum. The resulting PSNR for each 
user class is computed for each realization. The vertical axis 
shows the relative frequency that the PSNR is larger than the de¬ 
sired threshold (Prob (PSNRj > 7 j)) for each of the two user 
classes. It can be seen from the left side of Fig. |7]that the simu¬ 
lation results closely match the and-or tree analysis. The prob¬ 
ability of reaching target PSNR 7 j in simulation is very close 
to the desired probability threshold Pj . Also, by increasing the 
overhead to 1.525, a higher probability in reaching target QoS 
can be obtained. 



Fig. 7. Outage probability comparison, simulation results versus analysis for 
given desired thresholds, 

L = 2; 5 = [1000, 8000]; 6 = [0.4, 0.8]; P = [0.95, 0.8]; pi = 0.19. 


VI. Conclusions 

A randomly interleaved rateless coder for scalable multi- 
media multicasting systems with heterogeneous users is opti¬ 
mized for guaranteed and best-effort QoS. The resulting de¬ 
sign achieves unequal error protection. Further, guaranteeing 
QoS is shown to be a convex optimization problem, which can 
be solved analytically in practical scenarios. Numerical results 
show the transmission overhead required for the optimized pro¬ 
posed UEP rateless codes to be significantly less than that for 
EEP design and at least as low as recent optimized EWF and 
non-uniform-selection UEP rateless code designs. Significant 
gains for the proposed UEP scheme can be obtained by employ¬ 
ing standardized raptor codes. For example, in the best-effort 
QoS example in Fig. [ 6 ] the maximum achievable average PSNR 
using the proposed design employing standardized raptor codes 
is about 8 dB higher than that of either the proposed or EWF 
designs based on LT codes with iterative decoding. 


Appendix 


We prove the last part of Lemma 1, i.e., after performing 
Algorithm 1, transmission overhead cannot be further reduced 
with additional layer partitioning and selection probability re¬ 
assignment. Let Scheme A denote the source-to-channel layer 
mapping produced by Algorithm 1 and denote Scheme B as one 
which further partitions Layer l into Layers m and n with di¬ 
mensions S m and S n , respectively. Denote the resulting opti¬ 
mal selection probabilities for Scheme B which minimize the 
transmission overhead as p m and p n for Layers to and n, re¬ 
spectively. We now show that the minimum required trans¬ 
mission overhead is no larger by using Scheme A with se¬ 
lection probability pi = p m + p n assigned to Layer l. For 
the same number of total transmitted symbols M, the effec¬ 
tive average raptor code rates for Layer l in Scheme A, Layer 
to in Scheme B and Layer n in Scheme B are lit = 

R rn = and R n = -jfy -, respectively. Without loss of 


generality, we assume p m /S m > p n /S n . Then it can be shown 


that Ri = 


s m +s n 


< 


^Pm + ^Pn 


Sn 

Mpn 


= R n . As 


M(pm+Pn) — M(p m +p„) 

the decoding failure probability of the raptor codes is monoton- 
ically increasing with code rate for the same user class, we have 
(1 -Pe{l,j)) > (1 -P e (n,j)) > (1 — P e {m,j))(l — P e (n,j)) 
for any class index j, where P e (.) is the same decoding fail¬ 
ure probability function as defined in (j4j. This means that for 
the same number of transmitted symbols, the original mapping 
scheme (Scheme A) has higher probability of successfully de¬ 
coding all the symbols in Layer l than Scheme B for all user 
classes. Therefore, for the same QoS constraints described by 
©, Scheme A requires less minimum transmission overhead 
compared to Scheme B. Finally, raptor codes with larger dimen¬ 
sion have better performance for the same code rate, which also 
implies no further layer partitioning. 
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